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The neighboring optimal feedback control law is developed for sys- 
tems with a piecewise linear control for the case where the optimal 
control is obtained by nonlinear programming techniques. To de- 
velop the control perturbation for a given deviation from the nomi- 
nal path, the second variation is minimized subject to the constraint 
that the final conditions be satisfied (neighboring suboptimal con- 
trol). This process leads to a feedback relationship between the 
control perturbation and the measured deviation from the nomi- 
nal state. Neighboring suboptimal control is applied to the lunar 
launch problem. Two approaches, single optimization and multiple 
optimization, for calculating the gains are used, and the gains are 
tested in a guidance simulation with a mismatch in the acceleration 
of gravity. Both approaches give acceptable results, but multiple 
optimization keeps the perturbed path closer to the nominal path. 

INTRODUCTION 

In order to develop the neighboring optimal guidance law for a dynamical system, 
it is first necessary to obtain the optimal control. Currently, most trajectory opti- 
mization (see Ref. 1, for example) is accomplished by restricting the class of control 
functions to some subclass, say piecewise linear functions (suboptimal control). Then, 
the control parameters are the nodes of a piecewise linear function, and the su op- 
timal control is found by applying nonlinear programming methods. The subject of 
this paper is neighboring optimal control for systems with piecewise linear controls, 
or neighboring suboptimal control, and its application to vehicle guidance. 

In Refs. 2 and 3, the neighboring suboptimal control problem is formulated as a 
free final time problem and applied to the lunar launch problem. This formulation 
requires an iteration at each sample point to find the normalized time. In this paper, 
neighboring suboptimal control is formulated as a fixed final time problem and applied 
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to the lunar launch problem. While this problem is a minimum time problem, it can 
be converted to a “fixed final time” problem by using the horizontal component of 
velocity, whose final value is fixed, as the variable of integration. 

Two approaches for computing the control gains are presented. In the single 
optimization approach, the nominal suboptimal control is viewed as a sequence of 
reduced-node suboptimal controls to the final constraint manifold. Hence, the quality 
of the suboptimal control diminishes along the flight path. In the multiple optimiza- 
tion approach, a new full-node suboptimal control is computed from each node of the 
nominal suboptimal trajectory to the final constraint manifold. Hence, the quality of 
the suboptimal control along the flight path is maintained. 

After the suboptimal control problem and the neighboring suboptimal control 
problem are summarized, the lunar launch problem is defined. Then, the single 
optimization and multiple optimization approaches are used to compute the gains 
which are, in turn, tested in a simulation with a mismatch in the acceleration of 
gravity. Finally, some conclusions are reached about the use of these two approaches. 

SUBOPTIMAL CONTROL PROBLEM 


The fixed final time optimal control problem being considered here is to find the 
control history u(t) which minimizes the performance index 


J = «£(x/) 

(1) 

subject to the state differential equations 
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( 2 ) 

the prescribed initial conditions 
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( 3 ) 

and the prescribed final conditions 


Tj = 1, rl>{x f ) = 0. 

( 4 ) 


Here, the time has been normalized by the final time, that is, r - t/tf. This op- 
timal control problem is converted into a suboptimal control problem (parameter 
optimization problem) by assuming that controls are piecewise linear, meaning that 
the unknowns become the nodes of the linear control segments. 

If a denotes the unknown parameter vector which for one control is written as 
a T = [uj, u 2 , . . . , u r j, the differential equation (2) and its boundary conditions can be 

rewritten as , 

ax , x 
— =^(r l x,a) 

T 0 = T 0j , X 0 - X 0j , Tj = 1. 
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( 5 ) 

( 6 ) 



Given a, these equations can be integrated to obtain xj — xj(a) so that J 
<f>[xf(a)] = F(a) and tf>[x f (a)] = C{a ) Then, the suboptimal control problem is to 
find the parameter vector a which minimizes the performance index J = F(a ) sub- 
ject to the constraint C(a) = 0. 

To solve the suboptimal control problem analytically, the augmented performance 
index J' = F(a) -f v T C(a) = G(a, u) is formed. The first variation conditions are 
G a — 0 and C = 0 which determine a and v. The second variation becomes 8 J = 
8a T G aa Sa > 0 where CM = 0. 8a can be divided into dependent and independent 
parts; the dependent parts can be eliminated; and the second variation condition 
becomes the positive definiteness of a matrix. 

At this point, it is assumed that the suboptimal control problem is solved by using 
a nonlinear programming code (see Ref. 1, for example), and the next step is to find 
the neighboring suboptimal control. 

NEIGHBORING SUBOPTIMAL CONTROL 

The solution of the suboptimal control problem gives nominal control and state 
histories to be followed by the vehicle. However, because of modeling errors, the 
vehicle deviates from the nominal state. Hence, it is desired to find the neighboring 
suboptimal control perturbation which enables the vehicle to operate in the neigh- 
borhood of the nominal trajectory. The general philosophy is to find the control 
perturbation which minimizes the increase in the performance index while satisfying 
the prescribed final conditions. 

Since the first variation vanishes along the suboptimal path, the increase in the 
performance index is the second variation 

A J = l -8a T G aa 8a 00 

where the second derivative matrix G aa can be computed numerically. The elements of 
8a are not independent but are constrained by the need to satisfy the final conditions 

8xp = xp X/ 6x f = 0. 

The variation of the state equation (5) leads to the differential equation 

8x = g x 8x + g a 8a 

dr 

which must be solved subject to the boundary conditions 

T 0 = T 0j , 8x 0 = 6x 0 , 

Tf = 1, = 0- 

Next, the solution of Eq. (9) is assumed to have the transition matrix form 

8x = <t>8x j -f 'Ma 


( 8 ) 

( 9 ) 

( 10 ) 

(H) 
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where 


$, = /, ^=0 . ( l2 ) 

to guarantee that Sx } = Sxj. Then, substituting Eq. (11) into Eq. (9) and equating 
like coefficients leads to the differential equations 

vP' = g x ^ + 9a ( 13 ) 

= g x $ 

which must be solved subject to the boundary conditions (12). Once 

been obtained, Eq. (11) can be used. 

To satisfy the final condition (10), Eq. (11) is evaluated at r 0 and 

6x f = $ 0^*0 - V'Ma 

Then, Eq. (10) leads to 

‘ip Xf <&Q 1 '$of>a - = 0 ( 15 ) 

which is the constraint on the control node perturbation, 6a, imposed by the final 
condition. 

The last step is to minimize A J as given by Eq. (7) with respect to 6a subject to 
the constraint (15). Standard parameter optimization methods lead to 

6a = K 0 6x o ( 16 ) 


$ and have 
rewritten as 

( 14 ) 


where the gain Ko is given by 


$ 




x f 


% 


-1 


( 17 ) 


The computation of the gains can be checked by observing that K 0 = da . /dx Q 
and using numerical differentiation. Given a suboptimal control and state history, 
a perturbation in the state is introduced at some node, and the suboptimal control 
from that perturbed state to the final constraint manifold is computed. The gams 
are computed as K 0 (i,j) = Aa(*)/A*o(j) where Aa is the change in the suboptimal 

control caused by the change in the state. 

The application of neighboring suboptimal control as a guidance law is discussed 
in terms of the lunar launch problem which is defined in the next section. 
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LUNAR LAUNCH PROBLEM 


The lunar launch problem is to insert a payload in circular lunar orbit over a 
flat moon using a rocket with constant thrust acceleration. While this is a free final 
time problem, it can be converted to a “fixed final time” problem by cho OSing the 
horizontal component of velocity as the variable of integration. With the variable 
of integration normalized as « = (« - «o)/(«/ - «o), the optimal control problem 
is stated as follows: Find the thrust inclination history 0(«) which minimizes the 
performance index ^ g^ 


subject to the equations of motion 


dt 

{uj - u 0 ) 

(19) 

du 

a cos 0 


dy_ 

(u/ - u 0 )v 

(20) 

du 

a cos 0 


dv 

(u f - Uo)(asin0 - g) 

(21) 

du 

~ a cos 0 



and the boundary conditions 

So = 0, t 0 = 0, yo = 0, v 0 = 0 , 


( 22 ) 


u f = 1, y f = 50,000 ft , v/ = 0 ft/ sec. 


(23) 


In these equations, a = 20.8 ft/sec 2 is the thrust acceleration, g - 
acceleration of gravity, u, = 5444 ft/sec is the satellite speed, and u 0 - 0 ft/sec. 

For a piecewise linear control involving nine nodes, the nonlinear programming 
code VF02AD gives the following suboptimal control in degrees: 


0! = 26.01 0 2 = 23.3 1 0 3 = 20.51 

0 4 = 17.65 0 5 = 14.86 0 6 = H-90 ( 24 ) 

0 7 = 8.98 0 8 = 0.01 0g = 3.03 


Two approaches for applying neighboring suboptimal control are discussed: the 
single optimization approach and the multiple optimization approach Here, n 0 - 0 
for the single optimization approach or a node value for the multiple op ^ iza 
approach. In Ref. 4, neighboring suboptimal control results are presented for the cases 
where there is a thrust acceleration or a gravity modeling error. Only the gravi y 
case is discussed here because it has the largest errors. 
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SINGLE OPTIMIZATION APPROACH 

In this approach, the suboptimal control from node 1 to node 9 is considered to 
be a sequence of reduced-node suboptimal controls. In other words the suboptimal 
control from node 1 to node 9 is a nine-node suboptimal control. From node 2 to 
node 9, it is an eight-node suboptimal control; from node 3 to node 9, it is a seven- 
node suboptimal control; and so on. At node 8, there are only two nodes available, 
but these are enough to satisfy the boundary conditions (no optimization). Next, the 
9x3 gain matrix, K 0 in Eq. (17), is computed backward to each node and saved. The 
gains associated with the state t are all zero because there is no condition imposed 
on t f . Hence, the gain matrix, reduces to a 9 x 2 matrix, and the states are now 

°If thestate perturbation occurs at node 8, only Sa 8 is of interest for a sample and 
hold system. Hence, only the gains K 0 ( 8, 1) and A' 0 (8,2) are needed. Similarly, if the 
state perturbation occurs at node 7, only K 0 ( 7, 1) and K 0 ( 7, 2) are needed to compute 
Sa 7 , and so on. For a state perturbation between nodes, the gams are obtained y 
linearly interpolating the gains at adjacent nodes. To have gains over the last or 8th 
interval, the gains at nodes 7 and 8 are linearly extrapolated. In conclusion, only the 
gains K 0 (i, 1) and A 0 (i, 2) where i = 1, .... 9 need to be stored in the A>g ht computer. 

This approach to neighboring extremal control is tested by introducing T 
error in the acceleration of gravity. In other words, the true value of 5 is taken to be 
ip 5% different than the value being used in the computation of the gains, ams are 
computed and stored at every node or at every 0.125u for 9 nodes (Table 1)- The 
sample points are assumed to occur at every integration step of the simulation. Here, 
64 integration steps are used so that a sample point occurs every 0.015625u. The 
nominal states are obtained by numerical integration of the equations of motion sub- 
ject to the suboptimal control (24). The true states are obtained by integrating the 
equations of motion with the true acceleration of gravity subject to the neighboring 
suboptimal control. At each sample point, the true states and nomina states are 
differenced and the differences multiplied by the gains to obtain the control pertur- 
bation. The control perturbation is assumed constant over the sample period, u 
it is added to the piecewise-linear nominal control. Hence, the applied control varies 

linearly over the sample period. . , . . a 

The deviations between the true states and the desired values at the final point are 

presented in Table 2 along with the values which would have been obtained had e 
nominal control (24) been applied open loop. On a relative basis, the improvemen 
is substantial. However, a statement about the absolute quality of the closed-1 p 
results cannot be made without some performance criteria, say for example, that the 
vehicle has only so much AV to meet the desired final conditions precisely. 

Time histories of the deviations are shown in Fig. 1. Throughout the trajectory, 
the deviations are small, but they do not go to zero at the end. There are two 
possible reasons for this: (a) the quality of the suboptimal trajectory as the vehicle 
moves along its path and (b) the size of the last interval over which the gains are 
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Table 1 


9-NODE SINGLE OPTIMIZATION GAINS 


Node 

y Gain 

v Gain 

1 

-0.369E-5 

-0.673E-3 

2 

-0.289E-5 

-0.462E-3 

3 

-0.385E-5 

-0.521E-3 

4 

-0.573E-5 

-0.640E-3 

5 

-0.940E-5 

-0.831E-3 

6 

-0.179E-4 

-0.118E-2 

7 

-0.461E-4 

-0.201E-2 

8 

-0.267E-3 

-0.581E-2 

9 

-0.488E-3 

-0.961E-2 


Both of t ihese concerns can be addressed by increasing the number of nodes Hence 
the computations have been repeated for 17 nodes. The final point ^.ations « 
presented in Table 2 and show considerable improvement relative to those of 9 no . 
However, the deviation histories do not change appreciably relative to Fig. 1. 


Table 2 


DEVIATION FROM DESIRED FINAL CONDITIONS 


% Change 
in g 

State 

Open Loop 

-5.0 

yj 

9891.024 


Vi 

72.540 

+5.0 

yj 

-9891.023 


Vj 

-72.540 


Closed Loop Closed Loop 
9 Node 17 Node 

Single Opt. Single Opt. 
65.178 20.959 

-2.705 -1.977 

-63.989 -19.917 

2.616 1-832 


Closed Loop 
9 Node 
Mult. Opt. 
48.137 
-1.566 
-47.891 
1.542 
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Figure 1: 9-Node Single Optimization Deviation Histories 


MULTIPLE OPTIMIZATION APPROACH 

In an attempt to improve just the quality of the neighboring suboptimal control, a 
9-node suboptimal control to the final constraint manifold is computed from each no e 
of the nominal trajectory (Fig. 2), and the gains are computed for each subtrajectory 
bv Eq. (17). These gains are presented in Table 3 and are seen to be larger than those 
of the single optimization approach and uniformly increasing towar t e na poin . 
The use of these gains in the simulation with a q=5% mismatch in the acceleration of 
gravity leads to the final results of Table 2. These closed-loop results are somewhat 
better than those of the single optimization results for 9 nodes. 

The time histories of the deviations are shown in Fig. 3. Overall these deviations 
axe smaller than those of single optimization. Again, the fact that the deviations o 
not go to zero can probably be attributed to the extrapolation of the gams at nodes 
7 and 8 over the last interval. 


DISCUSSION AND CONCLUSIONS 

Two approaches for computing the gains for the neighboring suboptimal control 
guidance law have been tested in a simulation of a lunar launch vehicle: the single 
optimization approach and the multiple optimization approach. In both approaches, 
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Figure 2: Multiple Optimization Approach 


Table 3 

9-NODE MULTIPLE OPTIMIZATION GAINS 


Node 

y Gain 

v Gain 

1 

-0.369E-5 

-0.673E-3 

2 

-0.494E-5 

-0.780E-3 

3 

-0.688E-5 

-0.921E-3 

4 

-0.101E-4 

-0.112E-2 

5 

-0.161E-4 

-0.141E-2 

6 

-0.290E-4 

-0.190 E-2 

7 

-0.661E-4 

-0.288E-2 

8 

-0.267E-3 

-0.581E-2 

9 

-0.468E-3 

-0.874E-2 
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Figure 3: 9-Node Multiple Optimization Deviation Histories 

a suboptimal control and trajectory with evenly spaced nodes is used as a base, and 
the number of gains which must be stored is very small. 

For single optimization, that part of the suboptimal trajectory from a generic 
node to the final constraint manifold is thought of as a reduced-node suboptimal 
trajectory. Hence, the control becomes less optimal (fewer nodes) toward the end of 
the trajectory and eventually runs out of nodes for satisfying the boundary conditions. 
However, the gains generated by this approach produce good results in a guidance 
simulation. The final point results can be improved by increasing the number of 

The multiple optimization approach is to find a full-node suboptimal control from 
each node of the nominal path to the final constraint manifold. Gains generated from 
these subtrajectories are larger than those of the single optimization approach, are 
uniformly increasing toward the final point, and produce better guidance results, that 

is, the deviations are smaller along the path. 

From these results, it is apparent that the single optimization approach can satis- 
factorily meet the final conditions. On the other hand, if the perturbed trajectory is 
to lie close to the nominal trajectory, the quality of the optimization along the path 
must be improved. Multiple optimization does this, but the amount of computation 
is considerably more than that of single optimization. 
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